Estimation of Generalization Error: Random and Fixed Inputs
نویسندگان
چکیده
In multicategory classification, an estimated generalization error is often used to quantify a classifier’s generalization ability. As a result, quality of estimation of the generalization error becomes crucial in tuning and combining classifiers. This article proposes an estimation methodology for the generalization error, permitting a treatment of both fixed and random inputs, which is in contrast to the conditional classification error commonly used in the statistics literature. In particular, we derive a novel data perturbation technique, that jointly perturbs both inputs and outputs, to estimate the generalization error. We show that the proposed technique yields optimal tuning and combination, as measured by generalization. We also demonstrate via simulation that it outperforms cross-validation for both fixed and random designs, in the context of margin classification. The results support utility of the proposed methodology.
منابع مشابه
An Additive Model for Estimation Return to Scale in Regulated Environment with Quasi-Fixed Inputs in Data Envelopment Analysis (DEA)
The measurement of RTS amounts measures a relationship between inputs and outputs in a production structure. There are many different ways to calculate RTS in primal or dual space. But in more realistic cases, governments usually intervene on DMU’s behavior as regulatory agency, this clearly represent a set of limitations and restrictions on behaviors of DMUs, So very few decisions in DMUs are ...
متن کاملبرآورد ناپارامتریک و شبهپارامتریک تابع تولید صنعت خودرو با تاکید بر نهاده انرژی: معرفی روش اولی - پاکس (OP) در برآورد الگوی دادههای ترکیبی
Unobservable productivity shocks cause selection and simultaneity problems in firm’s decisions and these problems cause estimators such as ordinary least squares, have biased estimation for coefficients of production function inputs. In this study, data of five automaker companies in the period of 1383-1387 have been used and production function of car industry have been estimated by ordinary l...
متن کاملRandom fixed point of Meir-Keeler contraction mappings and its application
In this paper we introduce a generalization of Meir-Keeler contraction forrandom mapping T : Ω×C → C, where C be a nonempty subset of a Banachspace X and (Ω,Σ) be a measurable space with being a sigma-algebra of sub-sets of. Also, we apply such type of random fixed point results to prove theexistence and unicity of a solution for an special random integral equation.
متن کاملEstimation of Variance Components for Body Weight of Moghani Sheep Using B-Spline Random Regression Models
The aim of the present study was the estimation of (co) variance components and genetic parameters for body weight of Moghani sheep, using random regression models based on B-Splines functions. The data set included 9165 body weight records from 60 to 360 days of age from 2811 Moghani sheep, collected between 1994 to 2013 from Jafar-Abad Animal Research and Breeding Institute, Ardabil province,...
متن کاملModeling of measurement error in refractive index determination of fuel cell using neural network and genetic algorithm
Abstract: In this paper, a method for determination of refractive index in membrane of fuel cell on basis of three-longitudinal-mode laser heterodyne interferometer is presented. The optical path difference between the target and reference paths is fixed and phase shift is then calculated in terms of refractive index shift. The measurement accuracy of this system is limited by nonlinearity erro...
متن کامل